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@ On-the-fly error correction with embedded digital controller. 

(57) An on-the-fly error correction method for conrecting a data block within a stream of substantially 
contiguous data blocks during a data decoding process, each block having error syndrome information 
calculated and appended in accordance with a predetermined Reed Solomon code during an encoding 
process occurring before the data decoding process, comprising the steps of : 

passing the block of data through a syndrome Information recovery circuit to obtain recovered 
information related to the error syndrome information, 

comparing the recovered information with zero to determine if an enror burst is present within the 
"" 'data block; and ~ ~~ ~ ~ _ - — 

if an error burst is determined to be present : 

latching the recovered information into a remainder latch in order to free the syndrome 

information recovery circuit for the next data block of the stream, 

causing a microcontroller to call and execute an error correction service routine and thereupon : 
selectively transferring bytes of the recovered information from the remainder latch to the 

microcontroller, 

determining with the microcontroller the location and a corrected value for the error burst in 
accordance with a predetermined Reed Solomon enror correction algorithm implemented within the 
microcontroller by the service routine, and 

substituting the corrected value for the error burst within the block detected to contain the error 
burst before it is delivered to a host system. Cross checking the correction, and error detection of data 
block ID fields is also disclosed. 
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Field of the Invention 

The present invention relates to methods and apparatus for detecting and conrecting byte errors within a 
data stream. More particularly, the present invention relates to methods and apparatus for detecting and correc- 
5 ting byte errors within data blocks, such as data blocks read out from a storage surface within a rotating disk 
data storage subsystem, wherein the subsystem includes a programmed digital controller. 

Background of the Invention 

10 Error correction techniques are known in digital data processing systems, Including memory stores such 

as disk drive data storage subsystems and tape drive data storage subsystems, and they are known in digital 
data communications systems. A methodology is selected and implemented which generates syndrome infor- 
mation from substantially contiguous blocks, sets or subsets of a digital data stream. By "contiguous** is meant 
that the blocks occur one immediately after another, with minimum interval or time lapse between or separating 

15 the blocks. The syndrome information is appended to the block, set or subset at an encoder end, typically within 
a data writing or transmission process. At a data reading or reception process, the received error correction 
syndrome information, called herein the "remainder", is obtained from the received data block and compared 
with zero. If there are any non-zero remainder values or bytes, an error is determined to be present Depending 
upon the nature and amount of appended syndrome information, it is possible to detect and correct one or more 

20 error bursts in the data block. 

An error burst is a set of adjacent bit positions which are in error, A single error burst is one occurrence of 
such erroneous bit positions within a data block, and can extend over the interleaves thereof. A double error 
burst is two separate occurrences of error burst within the data block, for example. Error conrection systeros 
and strategies are typically rated upon their ability to locate and correct single burst and double burst errors. 

25 as well as the speed of execution and a low probability of miscorrection of error bursts by the error conrection 
strategy. 'f 
In the prior art approach, a syndrome converter is typically implemented as a shift register with taps ariS 
with external multiplier elements. In an external fonm, a summer combined multiplier tenms from the shift registieir 
with the incoming data stream in order to generate the syndrome. In commonly assigned U.S. Patent N^t 

30 4.730.321. to Machado. the disclosure of which is hereby incorporated by reference, an internal form was used^ 
in which the shift register Includes the summing nodes, in order to penmit logical elements to be shared and 
thereby minimize the number of structural elements required to implement the error correction hardware. 

Many error correction code schemes employed within disk drives have employed a Galois field of one bit 
(GF(2i]. In a one bit implementation, the multiplier terms are therefore either present or not present, and the 

35 summing node is implemented as an Exclusive OR gate. Several prior implementations have gone to a Galois 
Field byte basis (i.e., eight bits per symbol), as was done in the referenced U.S. Patent No. 4,730,321 to 
Machado. In those prior approaches, the syndrome generator received eight bits in and put eight bits out. The 
referenced Machado '321 patent employed a Galois field in which the primitive element aipha^ tenm is 2B (Hex) 
in order to reduce the number of gates needed to implement the syndrome generator circuitry, 

40 U.S. Patent No. 4,730,321 to Machado Is considered to be the closest prior art and is acknowledged in the 

precharacterising portion of claim 1 . 

The general problem with a classic error correction approach, whether bit/input to bit/output or a byte 
input/byte output, is that when one or more errors are detected within a particular block, the system data stream 
has to be shut down during the interval required to determine the location and correct the burst error or enrors 

45 within the block determined to include the error. 

Particulariy in disk drive data storage subsystems, it is now commonplace to encounter one or more pro- 
grammed monolithic digital microcontrollers. These microcontrol elements are very powerful in carrying out 
operations and calculations appropriate to supervise data transducer head positioning and track following ope- 
rations, and to supervise transfers of data, commands and status values with the host computing system, on 

50 an as needed basis. 

Since enrors and consequent error correction processes occur relatively infrequently in well designed and 
manufactured disk drives, the calculational power of the microcontroller is best adapted to the task of performing 
error location and correction operations on an as needed, interrupt basis. For example, if data blocks are being 
read from the disk surface, reading of subsequent blocks ceas s during the error location and correction pro- 
55 cess. As was the case in the referenced Machado '321 patent, after the data stream is shut down, the disk drive 
microcontroller obtains the ECC remainder for the data block having the error. Based upon the remainder, the 
microcontroller then calculates values for locating and correcting the error. Erroneous data values are then 
replaced by connect d data values inserted in the data block in a buffer memory. After correction, the data block 
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is then available to be sent out to the host computer, and at that point the data stream may be permitted to 
resume its flow of data blocks. 

In summary, the prior approaches, whether bit-based or byte-based, perform three common functions: they 
read the incoming data and compare the recovered remainder values with zero. If there is an error, they shut 

5 down th data stream, then they may perform a retry by e.g. rereading the data and resending it through the 
syndrome generator in an attempt to eliminate non-repeating or "soft" errors. If that doesn't work, i.e.. a hard 
data error is present, they then perform an error correction operation. If the operation is determined to be suc- 
cessful, the data stream is thereafter reinitiated. If the error is determined to be uncorrectable, an error message 
is sent to the host computing system or communications system, and data transfer process is stopped indefi- 

10 nitely. 

There have been several prior efforts directed to performing very rapid error correction. One architecture, 
believed to be embodied in a special hardware chip having many thousands of gates, is described in U.S. Patent 
No, 4,782,490 to Tenengolts, the disclosure of which is incorporated by reference. While Tenengolts speaks 
of embodiments employing both hardware and firmware to implement high speed error correction techniques, 

15 no practical guidance is provided for implementation of on-the-fly enror correction. For example, in the Tenen- 
golts patent Fig. 8 "error trapping" embodiment, the same shift register used to detect errors, is also used after 
an error is detected for the enror trapping process, thereby removing the availability of the shift register for the 
next block of incoming data in real time. 

As used herein, the expression "on-the-fly" means an error correction process which is carried out with 

20 minimized data flow interruption, and which does not require one or more disk rotation latencies (revolutions) 
for canrying out the correction process. In order to perform ECC "on-the-fly", it is necessary to detect and correct 
the data errors, and to do so in a manner which does not stop the flow of data blocks during a typical transfer 
of multiple blocks. Conventionally, these two requirements have heretofore been conceptually lumped together. 
Actually, the present inventors have discovered that these two requirements are separate and separable 

25 aspects to the overall problem of error correction. 

A syndrome generator is typically provided within a hardware implementation for error correction. This syn- 
drome generator operates on-the-fly upon the data stream In order to generate an ECC syndrome for each pre- 
determined data block. Once the syndrome is generated, it is typically appended at the end of the data block 
during the record ing/ECC encoding process. 

30 As taught by the referenced Machado '321 patent, the syndrome generator circuitry may also be used as 

a remainder recovery circuit during data block readback from the storage surface. The syndrome remainder 
bytes regenerated during readback from the data block or subsets coming e.g. from the disk surface are compa- 
red to zero by a hardware comparator. If the remainder bytes are not zero, an error is present in the readback 
data block or interleave thereof. In the Machado '321 patent approach, the data stream is then stopped, and 

35 the non-zero remainder bytes held in the syndrome generator shift registers are then passed via a bus structure 
to the microcontroller. The microcontroller then executes finmware routines to locate and correct the error or 
errors. The prior Machado *321 patent approach, while employing the on-board digital microcontroller to carry 
out error correction processes, did not do so on-the-fly. Detection of any error caused data flow to stop. 

Another typical approach to perform on-the-fly error correction has been to provide one or more dedicated 

40 hardware processors whose task it is to calculate the correction of errors and to insert corrected data for erro- 
neous data into one or more appropriate positions of the data block undergoing correction. Typically associated 
with dedicated hardware processors is a dedicated FIFO or buffer memory, typically four kilobits in storage 
capacity (512 bytes by 8 bits per byte). 

An example of a hardware-based ECC architecture is given in the referenced Tenengolts U.S. Patent No. 

45 4.782.490. As best understood, Tenengolts provides a very general approach to error correction and cross 
checking in conjunction with Fig. 2, and no mention or discussion is provided concerning error correction on 
the fly. However, in conjunction with his Fig. 8 embodiment, the syndrome that detected the burst error is shifted 
through the original ECC recovery circuit's shift registers until the error is located. The number of serial shifts 
required until zero value residue bytes are encountered is counted. This count then provides the enror burst 

50 location, while residue bytes immediately following the zero byte residue bytes provide a basis for determining 
the corrected data values to be inserted in the data stream. While Tenengolts proposes clocking the ECC shift 
registers at a four-times clock rate to permit "a single burst correction to be performed in the record transfer 
time" (Tenengolts Column 15, lines 47-48). it is not clear how such a procedure would enable the next conti- 
guous block of data to be passed through the shift registers in real time (i.e. on-the-fly), unless considerable 

55 gaps existed between the data blocks, or a block transfer delay was actually programmed into the data stream 
with a FIFO block buffer, etc. While the Tenengolts Fig. 2 approach is stated to be very fast, it is also admittedly 
complex. The Fig. 8 approach is said to be slower, but sufficiently fast, even though the decoding process is 
said to be implemented in finmware. While a finmware implementation is mentioned, no details, such as a fir- 
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mware code listing, are given in the Tenengolts patent. The Tenengolts patent also describes cross checking 
and ID field (header) error detecting techniques within the constraints of the hardware implementation given 
in Fig. 2. for example. 

Another high speed ECC architecture, involving dedicated parallel processors operating in pipeline manner 

5 to avoid inter-step transfers is described in U.S. Patent No. 4.567,594 to Deodhar, the disclosure of which is 
also incorporated by reference. The technique described in Deodhar U.S. Patent No. divides the Reed-Solomon 
decoding process into a sequence of steps which are carried out by the pipeline dedicated processors with a 
minimum of inter-step parameter transfers. 

A more enlightened prior art approach to error correction makes use of the data block buffer associated 

10 with the data controller function. This approach uses a DMA channel in order to access and control the data 
block buffer. The ECC hardware generates a DMA request to the block buffer thereby to gain access to the 
erroneous data cells which are to be corrected. This approach recognizes the advantage of using the already 
existent data block buffer memory for the error correction process. This is the approach followed in the refe- 
renced, commonly assigned U.S. Patent No. 4.730,321 to Machado. However, this prior approach did not 

15 achieve "on-the-fly" error correction. 

In addition to the inability to perfonn single burst enror correction on-the-fly. another drawback of the 
approach described in the referenced Machado *321 patent arises from the probability of misdetection and 
consequent miscorrection of data in the double burst correction mode, essentially limiting its utility to single 
burst correction. While the occurrence of random errors within the correction capability of a particular Reed 

20 Solomon error correction algorithm will result In a zero misconrection probability, if the number of random errors 
exceeds that correction capability, the miscorrection probability becomes quite high, approximately .233. This 
means that almost one quarter of the errors statistically occurring above the correction capability will tfemis- 
detected and miscorrected. Known prior techniques which have been employed to reduce the misdetectidfei pro-.^ 
bability have included cross checks, such as including check sums within the data block. However, such cross> j&srsg 

25 checks have not been particulariy powerful, nor have they been able to detect shuffling or rearrangement-of ' ^ar^^ 
data values within the data block. ^ ^ - ? 

Summary of the Invention with Objects -^^ss^ c .r.i^ :^ 

30 A general object of the present invention is to provide "on the fly" error correction upon blocks of data^ithirt^t- 1 i^.-sf 

a sequence of data blocks by operatively combining a hardware-reduced ECC syndrome generator clrcuiiSnclins^ 
ding a remainder latch, and an embedded digital microcontroller and block buffer in a manner which overi^me^ ^v^-t^^- 
limitations and drawbacks of the prior art ® c. i . i i 

Another general object of the present invention is to realize a practical hardware-minimized implementation 
35 of "on-the-fly" error correction within a digital data system in which the correction processes are carriied out r 
substantially within drive-resident firmware executed as needed by an embedded microcontroller supervisor 
as a background operation and without interrupting the flow of data. ' 

Another general object of the present invention is to provide an ECC syndrome remainder recovery circuit 
which includes a latch for selectively latching ECC syndrome remainders not equalling zero, thereby enabling 
40 on-the-fly error correction to be carried out by a microcontroller and block buffer memory without stopping flow 
of data blocks within a digital data system. 

Another general object of the present invention is to provide a Reed Solomon cross check polynomial for 
use to check the correctablMty of error bursts in a data block by a Reed Solomon error correction polynomial, 
wherein both the cross check polynomial and the enror correction polynomial are based upon a Galois field in 
45 which a primitive element alpha' tenm is not equal to 02 (Hex). 

A more specific object of the present invention is to provide a Reed Solomon cross check polynomial 
employing a primitive element alpha' term having a 2B(HEX) value. 

A more specific object of the present invention is to provide an on-the-fly error correction system within a 
disk drive data storage subsystem including a supervisor microprocessor which determines error location and 
50 values within enror bursts detemnined to be present from latched non-zero syndrome remainder bytes as a back- 
ground operation, not usually requiring interruptton of flow of data blocks from the disk to a host computing sys- 
tem. 

Yet another more sp cific object of the present invention is to provide an enror correction method which 
includes on-the-fly cross checking in order to detenmine conrectability and thereby reduce the possibility of mis- 
55 correction of a detected error burst in a data block. 

A still further more specific object of the present invention is to provide a hardware-minimized error correc- 
tion/error detection syndrome generator apparatus which may be minimally modified in order to perform either 
two byte or three byte real time error detection for data block ID fields (headers), as well as recover remainders 
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for error correction of data blocks. 

One more specific object of the present invention is to provide a Reed Solomon based ID field error detec- 
tion method and apparatus which employs a Galois Field generator polynomial with non-consecutive roots in 
order to minimize hardware requirements without compromising Integrity of the error detection process. 

One more specific object of the present invention is to provide methods and structure for carrying out cross 
checking of the error conrection process while minimizing the amount of additional hardware required to perform 
cross checking. 

Yet a further specific object of the present invention is to provide a data block ID field error detection process 
which employs the same multiplier terms as are employed within an error correction process for the identified 
data block, thereby enabling use of common multiplier circuitry for both the ID field error detection and the data 
block error correction processes. 

Still one more specific object of the present invention is to provide an optimized, flexible hardware/finmware 
system which performs error correction and cross checking of data blocks and error detection of ID fields in 
real time and without data flow interruption, at least in the instance of single burst errors. 

Accordingly, in one aspect, the invention provides a Reed-Solomon enror correction apparatus having the 
features of claim 1. 

In another aspect, the invention provides a Reed-Solomon error conrection code syndrome generator appa- 
ratus having the features of claim 16. 

In yet another aspect, the invention provides an on-the-fly enror correction method having the features of 
claim 26. 

These and other objects, aspects, advantages and features of the present invention will be more fully under- 
stood and appreciated by those skilled in the art upon consideration of the following detailed description of a 
preferred embodiment, presented in conjunction with the accompanying drawings. 

Brief Description of the Drawings 

In the Drawings: 

Figure 1 is a schematic graph of a 51 2 byte data field for use with the methods and apparatus of the present 
invention wherein the data field is arranged into three interleaves and to which two cross check bytes, and 
twelve error correction bytes have been appended. 

Figure 2 is a plan view graph of a data surface of a rotating data storage disk, illustrating eight data zones 
of tracks 21 through Z8 each having a different data transfer rate and user data storage capacity, and commonly 
aligned servo sectors S extending throughout the data storage surface. 

Figure 3A is a schematic graph of a data ID field for a data sector as shown in the Figure 2 graph and 
employing a three byte error detection code. 

Fig. SB is a schematic graph of a data sector ID field not including count information, and therefore able 
to employ a less powerful two byte error detection code. 

Figs 4A&4B are a detailed block diagram of an ECC encoder/decoder and cross checking structure imple- 
menting the methods of the present invention. 

Fig. 5 is a series of timing diagrams illustrating operation of the Fig. 2 ECC structure. 

Figs 6A&6B are a block diagram showing the Fig. 2 ECC structure within a data sequencer element of a 
fixed disk drive. 

Fig. 7 is a block diagram of a fixed disk drive data storage subsystem including the Figs 6A&6B data sequen- 
cer and ECC structure implementing principles of the present invention. 

Detailed Description of a Preferred Embodiment 

While the present invention is effectively embodied in any system transferring blocks of data, whether sto- 
rage or communication, it finds particular utility and efficiency in the field of data storage and retrieval, such as 
within a fixed disk data storage subsystem. In such a system, to be described in greater detail hereinafter, data 
blocks are stored as data fields or data sectors within concentric data storage track locations of one or more 
data storage disks. As used herein, the terms data block, data field and data sector are used substantially inter- 
changeably. A data read/write transducer is selectively positioned over the track location of interest, and blocks 
are written to or read from data sector storage locations within the track location. It is the process of storing 
and retrieving blocks of data, for example, which gives rise to the need for error correction, cross checking, 
and ID field error detection techniques of the present invention. 

An exemplary data block or field found within a fixed disk data storage subsystem, for example, is set forth 
in Fig. 1 . There are 51 2 data bytes designated [1] through [51 2]. Bytes [51 3) and [514] comprise two cross check 
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bytes for cross checking the data bytes [1] through [512], and there are twelve ECC bytes [515] through [526] 
which are used for error correction of the data bytes and cross check bytes. As illustrated in the lower portion 
of Fig. 1, in this particular disk drive data block exannple there are three interleaves: interleave 1, interleave 2 
and interleave 3. Interleave 1 contains bytes [11], [4]. [7], [10], etc. Interleave 2 contains bytes [2], [5), [8], [111], 

5 etc. Interleave 3 contains bytes [3], [6], [9], [12], etc. Interleaves 1 and 2 each include a total of 1 71 data bytes, 
and interleave 3 contains 170 data bytes. As is apparent from Fig. 1, the two cross-check byte positions [513] 
and [514] are located in different interleaves, and there are four ECC bytes located in each interleave: bytes 
[517], [520], [523] and [526] in Interleave 1, bytes [515], [518], [521] and [524] in Interleave 2, and bytes [516], 
[519], [522], and [525] in Interleave 3. The use of within-sector interleaves is conventional in error correction 

10 schemes in order to extend the ECC capability to longer error burst lengths. Another reason is to spread a conti- 
guous block error burst across several Interleaves, thereby reducing the complexity of error conrection techni- 
ques, for example. 

In the presently preferred error correction example, a single error burst is defined as an error occurring in 
one byte within one of the interleaves. A double burst error is defined as an error occunring in two bytes within 
15 one of the interleaves. Given the three interleaves depicted in Fig. 1 , it is apparent that a single burst error may 
be up to three bytes in length, so long as each byte containing an error is located within a different one of the 
three interleaves. 

As was true with the single burst enror correction method described in the referenced Machado patent, the 
Galois field employed by the present error correction example is generated from the following polynomial: 

20 P(X) = X8 + + X3 + X2 + 1 (1) 

[Unless otherwise noted hereafter, the symbol designates a bit-by-bit exclusive-OR operation.] ^ 
The first element of the field was selected such that the number of gates needed to implement the multipliels 
is minimized: ^k** . ^ 

alpha^ = x5 + x3 + X + 1, (2) 
25 The model for the Reed-Solomon code implemented within the presently preferred embodiment is defined - 

by the following generator polynomial G(x) employing an exponent offset equal to 126: 
G(x) = x^ + alphai8.x3 + alphas^.xz + alpha^^.x + 1 (3) 
In factored form, equation (3) is: Jlt - -^t^ 

G(x) = (X +alphai26)(x + alpha i27)(x + alphai28)(x + alphai^^) (3a) < -M.. 

30 The twelve ECC bytes are generated upon passing all oftheuserdata bytes (and two check bytes) of eachr: - . .sr 

512 byte block through a syndrome generator 10 (shown in Fig. 4) on a serial byte-by-byte basis. The vai'Ce; r * 
C(x) is an encoded data polynomial written to the disk which is based upon the data and the cross-check byt^ - 
for each data block. The value D(x) is an encoded data polynomial based solely upon the data bytes of the ^^.-^ 
data block. The value C'(x) is the encoded data polynomial C(x) as read back from the disk surface. Thus. C'fx) > .'^ 

35 may contain one or more error bursts which must be detected and corrected, if conrectable. The value R(x) 
comprise the remainder which is related to the value C'(x). \% 
The encoded data codeword C(x) for each syndrome byte within each interleave is produced by the follo- 
wing equation: 

C(x) = [x4.D(x) mod G(x)] + x<.D(x) (4) 
40 During the readback process through the same syndrome generator 10, the remainder value R(x) for each 
interleave is recovered, and if not zero, is latched in a remainder latch 68 per the following: 

R(x) i. CXx) mod G(x) (Sa) 
45 - R3x3 + R2x2 + Rix + RO (5b) 

A programmed microcontroller 196 then decodes each partial syndrome byte (Si) of the code word for each 
interleave in accordance with the following: 

Si = R(x)nK>d(x alphas ^126) 
50 alphaci + 126) * 4 

(where i = 0, 1, 2 and 3 and where the exponential + Indicates an addition operation). If the remainder bytes 
(the coefficients of R(x)) have a non-zero value, there then exists at least one enror burst in the read-back data 
block. 

In the presently preferred embodiment given herein, the error correction proc ss above has the capability 
55 of correcting on-the-fly at least one enror burst for each one of the three interieaves.ln this, on-the-fly error 
correction process, the partial syndromes have the following forms: 

50 =e1.alphai26*Li (6) 

51 = e1.a!pha^27-Li (7) 
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52 = e1.aIpha''28-Li (S) 

53 = e1.alphai29-Li (9) 
The solutions for L1 (location) ande1 (single error value) are as follows: 

L1 = log[alpha»-i] = log[S1/S0] (10) 
5 el = S1.(S0/S1)127 (11) 

The system of the present invention also performs double-burst error correction in accordance with the fol- 
lowing partial syndromes: 

50 = e1-alpha^2e-Li + e2 alphai26-L2 (12) 

51 = e1alphai27-Li + e2alphai27*L2 (13) 
10 S2 = e1alphai28*Li + e2alphai28*L2 (14) 

S3 = e1-alphai29*Li + e2alphai29*L2 (15) 
In order to solve these equations, the following steps are performed. 
1. Values for phi 1 and phi 2 are found: 

_ S0 S3 + S1-S2 .^ex 

phi 1 = (16) 

IS S0S2 + SI2 ^ ' 

. - SrS3 4- S22 

P^'^ " S0-S2 ^ SI2 

There are several ways for solving the linear quadratic equation (18). One classic way is to implement a 
Chien Search loop which substitutes all Galois Field elements into the equation and detects for zero value. 
20 When a zero-value result is found, the loop stops and the input element is thereupon defined as XI .If no 

solution is found, an uncorrectable error event has been detected. Since equation (18) is a linear quadratic 
equation, 

phi 1 = x1 + x2 (18a) 

X2 may be found as follows: 
25 X2 = XI phi 1 (19) 

3. Locator values LI and L2 are then evaluated: 

L1 = log[x1] (20) 
L2 = log[x2] (21) 

4. Error values.el and e2 are then evaluated: 

30 ... S0X2 + S1 

®^ ■ X1i26 phi1 ^"^^^ 

e1 = QQ^^ ^ (23) 

X2126.phi1 

Given appropriate processing capabilities and speed at the microprocessor 196, and given a reasonable 
35 user confidence level in the correction process, it is practical to conrect double burst errors on-the-fly with the 
present methodology. 

In order to reduce the probability of miscorrection below .233 for multiple random errors, a generator poly- 
nomial Gxc(x) has been selected for performing a Reed Solomon cross check. This generator polynomial is of 
the fonm: 

40 G,c(x) = X2 + alphai (24) 

This cross check polynomial turns out to be a surprisingly powerful algorithm. It cross checks the error 
correction process and is sensitive to scrambling or misordering of the data values within the data block. This 
cross-check method is designed so that it may be carried out on-the-fly for the case at least of a single burst 
error per interieave, and with sufficient processing speed, for the case of a double burst error per interleave. 
45 If a particular error exists, a correct way of expressing the error is as a polynomial E(x) where: 

E(x) = epc" (25) 

In order to determine the effect of the error correction made to a data block on the corresponding cross 
check syndrome, alpha^ is substituted into the error polynomial (25) for x, and then the cross check syndrome 
(XC) is updated. Syndrome cross check (XC) = previous (XC) + ejalpha"-', wherein ejalpha*-' is computed by a 
50 programmed microcontroller 196 described he re in after. If the updated cross check bytes are not zero after the 
correction has been made to the data field, then the enror is determined to be uncorrectable by the present 
methodology. 

This approach provides a very rapid, on-the-fly method for verifying an error correction to determine that 
the present error burst is not within a predetermined miscorrection probability, such as .233, for the double burst 
55 correction case for example, and that an error correction operation performed on-the-fly in the single burst error 
scenario has not resulted in an erroneous correction or "miscorrection" of the data burst 

As noted it is desirable to use a predetermined alpha^ (a Galois field constant) of a value not equal to 2 in 
the cross check polynomial (s) above. By selecting a Reed-Solomon code with a predetermined beginning 
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offset, e.g. 126. a desirable alpha^ constant turns out to be 2B(HEX), or 0010101 1 (binary). In this particularly 
preferred embodiment, there are as many ones as there are zeros in the preferred alpha^ constant Thus, It 
has sufficient "mass" or "substance" to be surprisingly effective within the cross check polynomial, as it scram- 
bles itself very rapidly as it propagates through the ECC syndrome generator. 

5 Thus, in choosing the root alpha" for the cross check polynomial, the roots selected for the ECC polynomial 

are to be avoided. In the presently preferred embodiment, the ECC roots have been chosen to be n=126. n=127. 
n=128 and n=129. These values, being different than the cross check root n=1. simplify the firmware and speed 
up cross checking of single error burst confection per interleave on-the-fly. 

The data format for a disk drive in which the present invention is implemented employs a plurality, e.g. eight, 

10 radial data zones. Each zone includes plural concentric data tracks wherein each track has a predetermined 
number of data sectors. Each data sector contains a data ID field, followed by the Fig. 1 arrangement of 512 
data bytes, two cross check bytes, and twelve ECC bytes. The data ID field includes an address field followed 
by three EDC bytes. In order to maximize data storage, split, data fields or data sectors are employed. By "split 
data field"s meant that the servo sectors S extend as radial spokes on the data surface and interrupt or "split 

15 up" at least some of the data sectors into segments. Zoned data recording techniques of the presently contem- 
plated type employ either a constant disk angular velocity and a variable data transfer rate, or they employ a 
fixed data transfer rate and e.g. slow the disk speed as the data transducer moves from a radially outer zone 
to a radially inner zone. Presently, a switched data transfer rate Is preferred. Consequently, there are different 
data transfer rates for the different data zones, and there are different numbers of data sectors within the tracks 

20 of the different zones selected to optimize data storage capacity in the disk storage format. Also, as described 
in commonly assigned U.S. Patent No. 4.675.652 to Machado. entitled "Integrated Encoder Decoder for Varia- 
ble Length. Zero Run Length Limited Codes", the disclosure of which is hereby incorporated by reference, a 
1.7 RLL coding arrangement is preferably employed for data compaction on the storage surface of a data sto- 
rage disk 162. 

25 As shown in the example provided by the Fig. 2 layout diagram, the two opposite data storage surfaces of 

e.g., a two and one half inch diameter rotating data storage disk 1 62 may be defined to include a radially outer-- ^> 
most zone Z1 having e.g. 110 concentric data storage tracks per surface, each of which includes 60 data sectors 
and a raw data rate of 18.13 megabits per second (Mbps) (a coded data frequency of 27.20 MHz) for a totah 
storage capacity of 6.48 MBytes, for example. The next radially inward zone Z2 for example may contain 109^^$^ Tt. ^ 

30 data tracks, each of which includes 56 data sectors and has a raw data rate of 17.45 Mbps (a coded frequency 
of 26.18 MHz). Zone 2 stores 6.1 9 megabytes of user data on both sides of the disk 162. The third data zon^ 
Z3 contains 109 data tracks, each of which has 52 data sectors, and has a raw data rate of 15.69 Mbps (a coded 
data frequency of 23.53 MHz), and therefore stores 5.75 megabytes of user data on both sides of the disk 1 621 
The fourth data zone Z4 has 109 data tracks, each having 52 data sectors, and has a raw data rate of 15.89'-" - ^ 

35 Mbps (a coded data frequency of 23.53 MHz), and therefore stores 5.75 megabytes of user data on both sides - ^ ^ 
of the disk. The fifth data zone Z5 may have 109 data tracks, each having 47 data sectors, and has a raw data 
rate of 14.77 Mbps (a coded data frequency of 22.15 MHz), to achieve storage of 5.19 megabytes of user dafe 
on both sides of the disk 14. The sixth data zone Z6 has 108 data tracks, each having 44 sectors, and has a 
raw data rate of 14.00 Mbps (a coded data frequency of 21 .00 MHz), and achieves a capacity of 4.81 megabytes 

40 of user data on both sides of the disk 162. The seventh data zone Z7 has 108 data tracks, each having 41 
sectors and use a raw data rate of 1 2.95 Mbps (a coded frequency of 1 9.43 MHz) so as to store 4.48 megabytes 
on both sides of the disk 162. The eighth data zone Z8 may include 1 08 data tracks, each having 39 data sectors 
for example, and has a raw data rate of 12:09 Mbps (a coded frequency of 18:1 3-MHz) in order to store 4:26 
megabytes on both sides of the disk 162. A radially outenmost region SYS of the data surface 12, e.g. comprising 

45 14 concentric data tracks following the Zone 8 format, may contain system information needed for operation 
of the disk drive subsystem 100. Several tracks inside of the eighth data zone, and having the bit transfer cha- 
racteristics of the eighth data zone may be made available for storage of system diagnostics and other values. 

White these eight data zones are described as having the characteristics given above, it is to be understood 
that the number of zones, the number of tracks and sectors per zone, and the data transfer rates within each 

50 zone, may vary from the examples given, as is well understood by those skilled in the art. depending upon head 
and media design and quality, head position servo system, and desired storage capacity.ln the present exanv 
ple, a data storage capacity of 42.91 Megabytes is achieved with a single 2 1/2 inch diameter thin-film media 
data storage disk 162 and with thin film data transducer heads 170a and 170b. 

The data transfer frequencies employed within each zone are preferably synthesized e.g. by a frequency 

55 synthesizer 190 operating under the control of the system microcontroller 196. The synthesizer is therefore set 
to synthesize and put out to a PLL 188 the coded frequency for each data zone. 

What is to be considered important from the Fig. 1 example is the inclusion of count bytes within each data 
sector ID field, which enables a data sequencer 1 00 to determine immediately the structural layout of the imme- 
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diately following data field of the sector. This layout may or may not comprise a split data field and include one 
or more interrupting servo sectors, as shown in Fig. 2. 

Fig. 2 also illustrates diagrammatically a plurality of unifonmly spaced apart servo sectors S (e.g. 52 servo 
sectors) which are uniformly disposed about the data surface of a storage disk 162 and are embedded between 

5 or actually interrupt the data sectors in each zone. Since these servo sectors S "intenrupt" at least some of the 
data sectors, following every servo sector intenuption there is a special data sync field to enable the drive to 
resynchronize to the data rate. The data field count information provided in the data block ID field is used by 
the data sequencer 100 during the data writing and readback process to detenmlne the layout (i.e. number of 
and locations of servo sector intemjptions and the length of each resultant data field segment) of the data field 

10 of interest. Since the byte position counts may vary from data ID field to data ID field, it is very important to 
confirm the accuracy of the information recovered from the ID field in real time. The present invention advan- 
tageously employs the same syndrome generator circuit 10 (Figs 4A&4B) in order to perform Reed Solomon 
error detection process in real time for each one of the data ID fields read by a selected head 170. 

Fig. 3A illustrates a presently preferred data block ID field anrangement particularly well suited for split data 

15 field formats. Each ID field includes a nine byte preamble, an ID field address mark byte, three data field count 
bytes: count 3, count 2 and count 1; a sector identifier byte; and a head identifier byte. There are then three 
error detection code bytes: EDC byte 1 , EDC byte 2. and EDC byte 3, followed by a two byte pad. a write splice 
gap (gap 2) byte, and the data field, which ends at a servo sector S location. The values present within the 
three count bytes determine respectively the lengths of up to three segments of the data field as may be interrup- 

20 ted by servo sectors. For example, if there are no interruptions of the data field, count byte 1 would indicate a 
full field segment length of 512 bytes for example. If there are two servo sector interruptions of the data field, 
as is the case with some data blocks, count byte 1 would set forth the byte length of the first data field segment, 
count byte 2 sets forth the byte length of the second data field segment, and count byte 3 sets forth the length 
of the third data field segment, for example. 

25 With the ID fields used in the preferred zoned data recording scheme containing byte count values, it beco- 

mes very important to provide an enror detection method which will check the validity of the infonmation read 
back from each ID field. A three byte code is presently preferred in this application having split data fields with 
embedded servo sectors S interrupting at least some of the data blocks. An important aspect of the present 
invention is to provide an error detecting coding arrangement which makes use of the Figs 4A&4B syndrome 

30 generator. Such an arrangement has been discovered. The following presently most preferred code will gene- 
rate three enror detection check bytes which are appended to each data ID field: 

Gedc3(x) = x3 + ax2 + bx + c (26) 
In presently preferred form, this error detection code may be expressed as follows: 

Gedc3(x) = x3 + alpha87.x2 + alpha^^-x + 1 (26a) 

35 In factored fonnn, equation 26a is rewritten: 

Gedc3(x) = (x + alphaO)(x + alphaio8)(x + alpha^*/) (26b) 
It is apparent by inspection that these alpha coefficients are not consecutive. This code will generate three enror 
detection bytes. Since the syndrome generator depicted in Figs 4A&4B already has the alphas^ multiplier logic, 
that logic is available for use in the enror detection process. 

40 For drives not employing embedded servo sectors which intenrupt data fields of data sectors (and in which 

the data ID sectors do not contain byte counts) a two byte enror detection code has been determined to be suf- 
ficient One example of an ID field employing a two byte error detection code is depicted in Fig. 3B. In the exanv 
pie given in Fig. 3B. the ID field includes a nine byte preamble, an ID field address mark byte, a cylinder high 
byte, a cylinder low byte, a head byte, a sector byte, and two error detection code (EDC) bytes: EDC 1 and 

45 EDC 2. These EDC bytes are followed by a two byte pad. and by a gap 2 byte before reaching the data field 
which continues uninterrupted until the next data sector, which may be preceded by an embedded servo sector 
S. 

A presently most preferred two byte EDC code may be generated by the following generalized relationship: 

Gedc2(x) = x2 + ax + b (27) 
so where b is unequal to one. A more specific form of this relationship is: 

Gedc2(x) = x2 + x + alpha^s (27a) 

In factored fonm, equation (27a) is: 

Gedc2(x) = (X + alpha22)(x + alpha25i) (27b) 
With this code, only one multiplier circuit is required: alpha i®. which circuit is already present in the Fig. 4 
55 ECC syndrome generator circuit. 

Other EDC codes are known to be available for detecting enrors within the ID field of a data block. However, 
each of the following algorithms require an additional multiplier tenm. and therefore are slightly more compli- 
cated to implement than the algorithms (26) and (27) given above. 
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A presentiy less preferred three byte EDC code may be generated by the following relationship: 

* ■ 9 — ' r-- 

GEDC3W - x3 + alpha18.x2 + alpha^^.x + alpha^S (28) 



- (X + alpha5)(x + alpha6)(x + alpha^) (28'a) 

A presently less prefenred two byte EDC code may be generated by the following relationship: 

GeDC2(x) - x2 + alpha81*x + alphaSI. (29) 
« (x + alpha^05)(x + alpha231) (29a) 



15 With reference to Figs 4A&4B, an ECC syndrome generator circuit 10 simultaneously implements circuitry 

for performing the ECC function for the data fields, the cross checking function for the data fields, and the EDC 
function for the data block ID fields. These multiple functions carried out within a single, simplified circuit 10 
fecilitate on-the-fly error correction in accordance with the present invention. The Figs 4A&4B ECC circuit 10 
includes an incoming serial data bus 12 carrying serial-by-byte data. In accordance with the output state of a 

20 logic gate 13, a data extender latch 14 passes the serial data on the bus 12 onto an internal ECC bus 16. The 
bus 16 leads to a byte-wide exclusive OR (herein XOR) gate array 18 wherein each incoming byte Is exclu;^ 
sive-ORed (herein XORed) with a byte exiting the ECC portion of the circuit 1 0 on an outflow bus 63. The result ? ^ 
tant sum is passed by the gate 18 onto a bus 20 which leads through a zeroer circuit 22 to another bus 24^ ^ 
The zeroer circuit 22 is used to force bus 20 to zero when appending ECC or EDC bytes to data frelds, such 

25 as during data writing to disk operations or disk format operations, for example. : -r-s-m 

The bus 24 leads into a three byte register 26, the output of which leads to an XOR gate array 28. The buS$?E:/ f . -sn 
24 also leads to a bi-coefftcient multiplier which generates and puts out an alpha^^ product on a bus 32 an^^k^ ^ 
which also generates and puts out an alpha®^ product on a bus 34. The alpha^s product is XORed with the outpute^*: . . cm? 
from the shift register 26 in the byte-wide exclusive OR gate array 28, with the resultant sum then passe^mx:^ ^ m$ 

30 through another three byte shift register 36. The alpha^^ product on the bus 34 is then XORed with the outpuissEr . .^ ; 

from the register 36 in another byte-wide XOR gate array 38. An output sum of the XOR array 38 passes througfc^^ ♦ j^- 
another three byte shift register 40 and. the alpha^^ product on the bus 32 is XORed with the output from the 
shift register 40 and yields an output sum on a bus 43. "-'''-^ - * 

The output sum on the bus 43 is then selectively passed through a multiplexer 44 to a single byte register ^ 

35 46. The multiplexer 44 operates in conjunction with the EDC process, explained hereinafter. An output from -r 
the single byte register 46 is then XORed with the alpha^^ product on a bus 54. In this regard, the alpha®^ product 
on the bus 34 passes through a multiplexer 50 and a zeroer circuit 52 to reach the bus 54. An output on a bus- 
55 from an XOR gate array 48 then passes through a multiplexer 56 to reach a second single byte shift register 
58. The multiplexers 50 and 56 are used to select between ID fields having either two EDC bytes, or three EDC 

40 bytes. 

An output of the register 58 Is then XORed with the alpha^^ coefficient on the bus 54 in an XOR gate anray 
60. A resultant sum put out by the XOR array 60 is then passed into and through a single byte shift register 62 
which leads directly to ttie output bus 63. It is apparent that the last register group comprising registers 46, 58 
and 62 are split up by the XOR gate anrays 48 and 60 so that the EDC function can be carried out on either 
45 two or three EDC bytes. In this manner a Reed-Solomon error detection code of either two bytes or three bytes 
for checking data sector ID fields may readily be realized and implemented within the existing ECC syndrome 
generator 10 with minimal additional circuitry. 

During nonmal ECC operations, the zeroer circuit 52 presents zero coefficients, in lieu of either an incoming 
byte value (two byte EDC) or the alpha^^ product (three byte EDC). Thus, during nonmal ECC operations, the 
50 XOR gate arrays 48 and 60 are functionally inoperative in the paths between the registers 46, 58 and 62. The 
multiplexer 56 places the alpha^^ product into the register 58 during a two byte EDC operation. 

The output bus 63 from the last register 62 leads through a multiplexer 64 to an ECC output bus 66 which 
enables the circuit 10 to generate and clock out in proper time twelve ECC syndrom bytes which are appended 
into the ECC check byte positions of each data field, as depicted in Fig. 1. Two control signals: a byte dock 
55 signal BYTCLKA, and an ECC enable operation signal ECCENOP are combined in the AND gate 13. and its 
output then provides a master clock signal for clocking bytes progressively through the latch 14, and the shift 
registers 26, 36, 40, 46, 58 and 62. The shift registers 26, 36, 40, 46, 58 and 62 are initialized with predetermined 
initial values by assertion of an initialization control signal CTLIEC. The values may be arbitrarily selected, but 
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are repeated at each initialization of the shift registers. 

As noted hereinabove a syndrome remainder latch 68 is provided within the syndrome generator circuit 
10. This latch 68 receives and stores a number of remainder bytes. Three remainder bytes are received from 
the register 26 including a first byte over a bus 70. Three more remainder bytes are received from the shift regis- 

5 ter 36 including a fourth byte over a bus 72. Three more remainder bytes are received from the shift register 
40 including a seventh byte over a bus 74. A tenth remainder byte is received from the single byte shift register 
46 over a bus 76; and an eleventh remainder byte is received from the shift register 58 over a bus 77; while a 
twelfth remainder byte is received over a bus 79. 

The remainder latch 68 is controlled by three control signals. First, a five bit control signal LADD[4:0] compri- 

10 ses the five least significant address bits of the microcontroller cycle currently in operation. These five address 
bits are used by the microcontroller in order to fetch a particular latched remainder byte for processing by the 
microcontroller 196 during an on-the-fly ECC operation. Second, a syndrome latch dose control SYNLCLS 
signal is used to operate the remainder latch 68 to latch a particular (e.g. non-zero) remainder for on-the-fly 
ECC operations. The remainder latch 68 is normally in the transparent mode. Upon detection of an error as at 

15 the output line 80, the data sequencer 1 00 asserts this signal asynchronously to latch the remainder bytes from 
the shift registers 26, 36, 40, 46. 58 and 62. This operation is carried out before a reinitialization which occurs 
for the next data sector. Thus, the twelve byte ECC syndrome remainder provides the information from which 
the enror location(s) and error value(s) can be detenmined by the microcontroller 196 and corrected while the 
next data sector is being circulated through the syndrome generator circuit 10 and its remainder bytes are being 

20 recovered. Third, a syndrome latch open control SYNLOPE signal must be the opposite of the syndrome latch 
close control signal. This SYNLOPE signal is true when the syndrome remainder latch 68 is in its transparent 
mode, and low when the syndrome latch 68 is latched. 

The byte values on the buses 70, 72, 74, 76, and 77 are passed to the zero detect circuit 78 and checked 
to see if they are zero. Since syndrome bytes progressively clock through the shift register groups 26, 36 40 

25 and the last group of individual shift registers 46, 58 and 62 in three interieaves, only those bytes from the first 
one of each of the four groups of syndrome registers need to be input to the zero detect circuitry for ECC syn- 
drome zero checking. This arrangement saves logic elements. A parity with zero during data read operations 
indicates no errors have been detected within the present data block. 

Five control values enter the zero detect circuit 78: the ECC enable operation control ECCENOP; an enror 

30 detection operation control CRCOP; an error correction operation control ECCOP; the byte clock signal BYT- 
CLKA, and a no cross check operation control NOXC. These control signals progressively enable the zero 
detect circuit 78 to determine whether a particular ECC syndrome, cross check syndrome or EDC syndrome 
is zero, thereby facilitating on-the-fly correction. 

A single bit remainder output control value is presented on a line 80. This line will be at one logical state 

35 when the remainder bytes being checked are equal to zero, and it will be at another logical state when there 
is no parity with zero, as determined by the zero detect circuit 78. An on-the-fly service routine is called by the 
microcontroller 1 96. and as part of the execution thereof, the syndrome byte values actually latched by the syn- 
drome latch 68 are put out over a syndrome data bus 82 to the microcontroller 196 of the disk file whenever 
the single bit syndrome value on the line 80 indicates non-zero parity. Such a condition could arise from 

40 checking the twelve ECC bytes appended onto each data field. Another error condition microcontroller inter- 
vention could arise from checking either a three byte ID check field, or a two byte ID check field, during an error 
detection code (EDC) operation, for example. 

Once the microcontroller 196 locates and corrects an error burst in a sector on-the-fly, the possibility of a 
miscorrection having occurred is then tested by having the microcontroller 196 detemnine the impact of the 

45 correction of data upon the cross check remainder bytes which are also held in the remainder latch 68 and read 
by the microcontroller 196 as needed. 

In order to obtain the cross check remainder bytes, the data block on the ECC data bus 16 is passed through 
an exclusive OR gate 84. An output from the gate 84 is passed through a multiplier 86 which calculates and 
puts out an alpha^ product on a bus 88, The alpha^^ product is then latched into two byte cross check shift regis- 

50 ters 90a and 90b. An output from the shift register 90b feeds back to the XOR gate array 84 and also provides 
another input to the multiplexer 64, so that the cross check bytes can be appended to the data field as shown 
in Fig. 1. 

An output bus 94 from the register 90a enables the first cross check byte to be latched into the syndrome 
latch 68 and also to be checked by the zero detect circuit 78. Similariy, an output bus 96 from the register 90b 
55 enables the second cross check byte to be latched into the remainder latch 68 as well as checked for zero parity 
by the zero detect circuit 78. 

When an ECC remainder byte value is determined not to be zero and the twelve ECC remainder bytes are 
latched into the remainder latch, the two cross check remainder bytes then present in the registers 90a and 
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90b are also latched into the remainder latch 68, The cross check remainder bytes are then converted into par- 
tial syndrome bytes XC1 and XC2. After an ECC operation following calculation of an error burst correction by 
the microcontroller 196, it then calculates the impact of the correction upon the partial syndrome bytes XC1 
and XC2. If those bytes, after conrection of the data error, are not zero, an error which cannot be corrected by 

5 on-the-fly error correction is determined to be present, the data stream is stopped, and retries may then be han- 
dled in accordance with conventional ECC procedures, invoked in accordance with e.g. the SCSI interface 
convention, as one example. Ultimately, if the cross check remainder bytes are not zero after a correction, the 
error is determined to be uncorrectable, and the disk drive stops and informs the host equipment that an 
unconrectable enror has occurred. If the cross check bytes are zero after error correction, a possibility of a mis- 

10 conrection of data is thereby detenmined to be significantly reduced. 

The cross check shift registers 90a and 90b are initialized by the initialization control signal CTLIEC which 
sets them to a known state, and are clocked by the byte clock signal BYTCLKA and by a signal indicating that 
ECC and cross check operations are being performed (instead of an ID field error detection operation), as deter- 
mined by a control gate 89. 

15 The following description summarizes the various control and data signals whose labels appear in Figs 

4A&4B. Timing for these signals is provided in the timing graphs set forth in Fig. 5. 

SYNLDAT[7:0] Syndrome Latched Data: A byte of syndrome data is read out from the selected latched 
syndrome byte position as determined by the address on the five bit latched addres (LADD) bus. This byte is 
passed over a microcontroller address/data bus during a microcontroller read operation. This data is strobed 
20 by a RSECC control signal. 

LADD[4:0] Latched Address Bus: The five least significant address bits of the microcontroller address cycle.*^ 
currently in operation are input to address a particular one of the byte registers of the syndrome latch 68. These*::^- 
five address bits are used to determine which latched syndrome byte will be read out to the microcontroller 1 96^ 



during a microcontroller read operation. /r-^ 
25 SYNLCLS Syndrome Latch Close: This signal is used to close the syndrome latch 68 for on-the-fly errors. 

correction. The syndrome latch 68 is normally in the transparent mode. On detection of an error in the ECC^. : -s 
syndrome, a data sequencer circuit 100 (Figs 6A&6B) asserts this signal asynchronously to latch the data fromfe ^ 
the syndrome generator shift registers into the syndrome latch 68 so that the syndrome data indicative of the^. r , : s 
error will not be lost This assertion by the data sequencer 100 is done prior to a reinitialization that occurs for^„ - 
30 the next data sector via the initialization control signal CTLIEC. Thus, the syndrome containing the encoded^^ : . . 
error location can be decoded by the microcontroller 196, and corrected while the next sector is being read^ 
and passed through the syndrome shift register sets 26. 36. 40 and the last group of registers 44. 58 and 62.^, c tfc 

SYNLOPE Syndrome Latch Open: This control signal must be the opposite of the Syndrome Latch Close-r 
control signal. It is true when the syndrome latch 68 is in its transparent mode (latch output = syndrome) andsr 



35 false when the syndrome latch 68 is latched and static. ^: 
ECCCRCN ECC/CRC Control: This signal indicates what type of syndrome the ECC generator 10 is to cal^-t 
culate. When true, this control line indicates that ECC and a cross check are to be computed on data bytes.^ 
When false, this line indicates that EDO is to be computed on the bytes of an ID field. 

CTLIEC Control Interrupt ECC: When true, this control line asynchronously sets all syndrome bytes of both 

40 the ECC generator and the cross check generator to a predefined initialization value, which may be an invariant, 
yet arbitrary pattern of ones and zeros. 

WRGATE Write Gate: This control signal is used while either CRCOP (error detection of ID field) or ECCOP 
(ECC opefatidn) af6 assisrted titje in order to Slop the syndrome generatorfrom continuing to calculate the syn- 
drome while it is being clocked out by the byte dock BYTCLKA. 

45 CRC2B EDC Two Byte Format: When asserted true, this control signal selects a two byte format for the 

EDC on the data ID field. In this case, the same syndrome generator is used for two byte EDC. When deas- 
serted. this line selects a three byte EDC syndrome process. 

SYNDO Remainder Equals Zero: This control signal put out by the zero detect circuit 78 is used for detecting 
errors. On two byte EDC formats, this signal will be asserted following the rising edges of BTYCLKA if both 

50 EDC remainder bytes are equal to zero. With three byte EDC formats and during ECC operations, this control 
signal will indicate a zero EDC remainder byte condition if, following the final three rising edges of the byte dock 
BYTCLKA, this signal is asserted true. That is to say, if. on the last two rising edges of BYTCLKA, ECCENAB 
is asserted true, and on the rising edg after BYTCLKA after ECCENAB deasserts this signal is true, then the 
ECC remainder bytes are equal to zero. 

55 SERDOUTt7:1] Serial/Deserial Data Out; This bus 12 carries in data bytes to the ECC circuitry 10 upon 

which a syndrome is to be computed. Data must be valid on the rising edge of BYTCLKA. On data sequencer 
read operations. ECCIN data comes from the serializer/deserializer SERDES 106, as more fully explained in 
the previously referenced, commonly assigned U.S. Patent No. 4,675.652 to Machado. All of the data bytes. 
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cross check bytes and ECC bytes are clocked in synchronism with BYTCLKA. On data sequencer write ope- 
rations, ECCIN data comes from the FIFO buffer memory; all data bytes are clocked In synchronism with BYT- 
CLKA. and the cross check bytes and the syndrome bytes are calculated and appended to the data stream. 
Following the final data byte, ECCIN must be connected to ECCOUT, so that the ECC syndrome may be cal- 
5 culated on the cross check bytes. 

ECCDOUT[7:0] ECC Data Out: This bus 66 caries data out of the ECC circuit 10. Data is valid on the rising 
edge of BYTCLKA. ECC syndrome data is available at this port In byte serial format while WRGATE and ECCOP 
control lines are asserted true. EDC syndrome data Is available while WRGATE and CRCOP control lines are 
asserted true. 

10 BYTCLKA Byte Clock A: This signal represent the basic clocking signal for all ECC and EDC operations. 

ECCENOP ECC Enable Operations: This control line indicates which BYTCLKA edges contain data for 
the ECC circuit 1 0. The BYTCLKA signal is gated internally with this signal In the AND gate 1 3 so that any tran- 
sient errors will not occur while BYTCLKA is true. 

CRCOP EDC and Cross Check Control: This control line indicates which bytes are EDC bytes, and which 
15 bytes are cross check bytes. This signal must be asserted true on the rising edge of BYTCLKA on which an 
EDC or cross check byte will be clocked into the ECC generator block 10. 

ECCOP ECC Operation Control: This control line indicates which bytes are ECC bytes. This signal must 
be asserted true on the rising edge of BYTCLKA during which an ECC byte will be clocked into the ECC syn- 
drome generator 1 0. 

20 With reference to the timing diagrams of Fig. 5, the upper group of timing signals along a common horizontal 

time base represent the various logical conditions of the listed control signals during an EDC check upon an 
ID field. The tower group of timing signals in Fig. 5 represent the various logical conditions of the listed control 
signals during an ECC and cross check operation. 

Turning now to Figs 6A&6B. It is to be seen that the ECC syndrome generator circuit 1 0 is but one functional 

25 element within the data sequencer 100 of an overall disk drive architecture 160, as shown in Fig. 7 and dis- 
cussed in greater detail hereinafter. The data sequencer 100 includes a data field address mark detector 102 
which receives incoming data directly from a pulse detector. The detector 102 looks for a sequence of high 
frequency flux transitions, and when such Is detected, the PLL 188 is then locked to the sequence and the reco- 
vered digital run length encoded data is checked for the presence of the address mark sequence. In this regard, 

30 the address mark detector 102 monitors the data stream in order to detect a unique bit sequence which is not 
consistent with the 1 J encoding rules and which is predetermined to represent an address mark. The address 
mark detector 102 generates the byte clock signal BYTCLKA from the raw data stream as well as an address 
mark found control signal whenever a bit sequence representing an address mark is found in the raw data 
stream. 

35 A 1,7 run length limited encoder/decoder 104 encodes and decodes serial data into and from a 1,7 run 

length limited (RLL) code, and the serlalizer/deserializer(SERDES) 106 bundles and unbundles data bytes into 
and from serial bIt-by-bit fomriat. The encoder/decoder 104 and SERDES 106 are substantially as described 
in the referenced Machado U.S. Patent No. 4.675,652 noted above. A FIFO byte register 108 enables data 
bytes to be asynchronously transferred between the sequencer 100 and an external cache buffer memory array 

40 214. clocked by an external crystal clock standard (as opposed to the BYTCLKA which is synchronized with 
the raw data stream read back from the disk). A multiplexer 110 regulates bidirectional data flow through the 
serializer/deserializer 106 and encoder/decoder 1 04 so that ECC syndrome bytes generated by the ECC gene- 
rator 10 may be appended to data blocks flowing to the storage surface, and so that data values present on a 
writeable control store (WCS) bus 134 may also be sent to the disk for storage. 

45 Reference data sector (i.e. physical sector and transducer head) identification bytes read from data ID fields 

are passed through a comparison multiplexer 112 to a comparison circuit 114. The comparison circuit 114 
compares actual data sector Identification bytes received from the SERDES 106 with the reference identifica- 
tion bytes held in a sector counter 116. If a correspondence exists, the desired sector location has been rea- 
ched, and a Compare = 0 control signal is put out by the comparison circuit 114 to a jump control multiplexer 

50 circuit 132. 

A writable control store (WCS) 116 provides the dual function of storing control words which control all of 
the operational states of the sequencer 110, and which may be loaded with Information directly read and written 
by the microcontroller 196 at locations thereof controlled by addresses decoded by a microcontroller address 
decoder 118. During one half of the BYTCLKA clock cycle, a multiplexer 120 enables direct access by the mlcro- 
55 controller 196 to the control store 116. 

During the other half of the BYTCLKA cycle, addresses from a sequence controller 122 are used to address 
the control store memory area 116. The controller 122 includes a sequence control decoder block 124 which 
enables the controller 122 to jump to a plurality of predetermined states, a sequence address multiplexer which 
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selects between various addresses, a last address register 1 28 for holding the last sequencer address for appli- 
cation to the control store 1 16 via the multiplexer 120, and a writable control store multiplexer 130 which selec- 
tively feeds back the last address held in the register 128 to the sequence address multiplexer 126. 

The sequence controller 122 is directly controlled by a jump control multiplexer 1 32 which generates a jump 

5 control signal from a plurality of logical inputs as indicated in Figs 6A&6B. A 28 bit wide writable control store 
(WCS) data bus 1 34 directly communicates with the writable control store memory 116 and enables the values 
held therein to circulate throughout the sequencer 100 along the paths shown in Figs 6A&6B. An opcode bus 
136 leads to an opcode decoder 138 which decodes each five bit op code into a plurality of logical conditions 
on the control lines shown coming out of the opcode decoder 138. The ECC/CRC SEL line, the ECCOP line, 

10 and the CRCOP line directly connect to the ECC syndrome generator 10, as previously explained. A PUSH 
SEL line extends to a Push multiplexer 140 which enables e.g. data field count bytes C3, C2 and CI to be pus- 
hed directly onto the top of a four byte register stack 142. A top of stack (TOS) bus and a next of stack (NOS) 
bus connect the stack 142 to a byte sequence counter 146 via a multiplexer 144 which also has the ability to 
load the sequence counter 146 with "1" values. The byte sequence counter 146 maintains the present byte 

15 count (remaining field length in bytes within a block) within the sequencer 1 00. When the presently loaded byte 
count increments to zero, the end of a particular field is reached, and the sequence counter 146 puts out a 
SCNT=0 value to the jump control multiplexer 132, so that a next state may then be invoked. 

A control decoder 148 receives primary control bytes, secondary control bytes, and count select bytes from 
the writable control store 116 and decodes these values into specific logical control values which are put out 

20 over the control lines shown coming out of the decoder 148 in Figs 6A&6B. including the write gate signal 
WRGATE and an initialize ECC signal lECC which directly control the ECC syndrome generator 10, 

A loop counter 1 50 is preset with a number of loops to be made during a particular data block transfer tran-^- 
saction (each loop nominally represents the states required to transfer a data block), and generates a 
LOOPCNT=0 control value when the count reaches zero. This control value signifying that the required number 

25 of data blocks has been transferred is also provided to the jump control multiplexer 132. An index timeout coun-^ 

ter 152 keeps track of the beginning of each track by generating an index timeout value INXCNT=0 which is^^. > 
used to control the sequence controller 124. A once per revolution raw index signal stored in the first one ofi^o 
the servo sectors S is detected by the servo control circuit 180 and used to clock the index counter 152. Other^r f% 
inputs to the sequence controller are the jump value from the jump control multiplexer 132, the address mark / ^ 

30 found value AMFOUND from the address mark detector 102, and the byte sequence counter SCNT=0 value . 

from the sequence counter 146. ^ -.-^ . 

Referring to Fig. 7, a fixed disk drive data storage subsystem 1 60 includes at least one rotating data storage.^ 
disk 162 which may be a 3,5 inch diameter, or more preferably a 2.5 inch diameter storage disk, for example.^ * ^ 

(although this invention will work to advantage in conjunction with any practical disk diameter, whether larger 

35 or smaller) and which is rotated at a substantially constant angular velocity, such as 3600 RPM, by a spindle 
motor 164. such as a direct drive brushless DC spindle motor which is formed in conjunction with a disk spindle^ 
hub to which the disk 162 is secured. The motor 164 Is driven by a motor driver circuit 1 66 which is controlled^ - 
by a motor control circuit 1 68 contained within a VLSI circuit chip also containing the sequencer 1 00, the ENDEC 
104. a servo controller 180 and PWM output 178, a microcontroller interface 202 and a buffer memory controller 

40 210. 

The motor control circuit 168 is directly responsive to the servo control circuit 180 which supplies a raw 
index pulse read from e.g. the first one of the plural servo sectors S. The raw index pulse is timed against the 
reference frequency, and the servo control circuit 180 delivers speed up or slow down commands to the motor 
driver 166, as may be indicated from time to time. 

45 A plurality of e.g. thin film data transducer heads 1 70a and 1 70b are respectively associated with opposite 

major data storage surfaces of the at least one data storage disk 1 62. The data transducer heads are preferably, 
although not necessarily, mounted to in-line aligned load beams which in turn are attached to vertically aligned 
anris of an arm assembly 171 of a mass balanced rotary voice coil actuator 172. The heads 170a and 170b 
operate conventionally in a contact-start-stop relationship with respect to the data surface, and they "fly" above 

50 the surface during operations upon an air bearing as is conventional with Winchester fixed disk technology, for 
example. 

A coil of the actuator 172 is driven by a servo driver circuit 174 which develops bidirectional driving current 
in response from low pass filtered width-modulated pulses received from a filter 1 76. Th filter 1 76 is connected 
to the servo pulse width modulator 178 within the servo control circuit 180. Servo pulses which are shaped by 
55 a pulse detector 184 and peak detected by a peak detector 192 are converted to digital values in an analog to 
digital converter 194 and delivered directly to the microcontroller 196. Coarse head position information read 
by the head 170 from the embedded servo sectors S is directly decoded by the servo control circuit 180 in a 
manner which is independent of the PLL 188, ENDEC 104 and sequencer 100. The servo control circuit 180 
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is implemented with a master state machine which establishes timing for the various fields comprising each 
servo sector, including a track identification field which is decoded by a data reader of the circuit 180 and fine 
position offset signals. The amplitudes of the fine position offset signals are sampled and held in the peak detect 
circuit 1 92 and processed by the microcontroller 196 during a servo phase of its time shared operations. Further 
5 details of the servo arrangement presently preferred for the drive 1 60 are to be found in a commonly assigned 
U.S. Patent Application Serial No. 07/569,065, filed on August 17, 1990, the disclosure of which is hereby incor- 
porated by reference. 

Each data head 170a or 170b, for example, is selected by head select circuitry contained with In a pream- 
plifier/select/write driver circuit 182, such as a type SSI 32R4610R 2 or 4 channel thin-film head read/write 

10 device made by Silicon Systems. Inc, Tustin, Califomia. or equivalent. The analog flux transition signals trans- 
duced by the selected head are preamplified in a preamplifier portion of the circuit 1 82 which is preferably moun- 
ted to a thin, flexible plastic substrate (Mylar^) carrying multiple conductive traces within a henmetically sealed 
head and disk assembly,and in close proximity to the data transducer heads 170a and 170b in order to increase 
the signal to noise ratios of the analog signals recovered from the storage surfaces. 

15 The recovered analog data is then passed through a pulse detector 184 which transfomis the analog flux 

transitions into digital logic edges as is conventional. A write precompensation circuit 186 precompensates data 
to be written to the disk received firom the sequencer 100. The precompensated write data is supplied directly 
to the head select/preamplifier circuit 182 for delivery to the selected head transducer. A phase locked loop 
circuit 188 operates to separate the digital edges into data bit pairs (encoded data) and delivers those data bit 

20 pairs to the encoder/decoder 104 which is illustrated in Fig. 5 as being a part of the sequencer 100. A frequency 
synthesizer 1 90 enables a variety of reference frequencies to be synthesized in order to provide for zoned bit 
recording as described hereinabove. The synthesizer 190 operates directly under the control of a programmed 
microcontroller 1 96. The pulse detector 1 84, write precompensation circuit 1 86, PLL 1 88, frequency synthesizer 
190 and servo burst peak detector 192 are ait preferably contained within a single VLSI integrated circuit 

25 package, such as a type DP8491 made by National Semiconductor Corporation, or equivalent. 

A microcontroller address/data bus 198 and a high order address bus 200 extend directly from the micro- 
controller 196 to a microcontroller interface 202 which directly communicates with the sequencer 100, enco- 
der/decoder 1104, motor control 168. servo control circuit 180 and a buffer memory controller 210. As 
mentioned, these circuits are preferably, although not necessarily, embodied within a single VLSI application 

30 specific CMOS circuit, as suggested by the single block for these circuits in Fig 6A&6B. An address bus 204 
extends from the microcontroller interface 202 to a program memory 206. Certain of the address bit positions 
for addressing the program memory 206 are generated and put out directly to the program memory 206 by the 
microcontroller 196 over an address bus 208. 

The buffer memory confroller 210 generates buffer addresses and puts them out over a bus 212 to a buffer 

35 memory 214 which may preferably be configured as a data cache. Data blocks are transferred to and from the 
FIFO register 108 within the sequencer 100 and the buffer memory 214 via a buffer data bus 216. This bus 
216 also extends to an interface control circuit 218 which receives data blocks and command values via an 
interface bus 220 connected to a host system and stores the values and blocks in the memory 214. and which 
retums status values and data blocks from the memory 214 via the interface bus 220 to the host The interface 

40 circuit may implement industry standard host-subsystem bus level interfaces, such as the SCSI interface stan- 
dard, or it may implement the IDE interface standard (IBM AT bus), as is conventionally known in the art. For 
example, an NCR type 5380 SCSI interface controller is a suitable interface control circuit 21 8 for implementing 
the SCSI bus convention standard. 

A direct memory access path exists between the microcontroller 196 and the buffer memory 214 via the 

45 micro interface 202 and the buffer control 210. With this path, it is practical for the microcontroller to access 
particular byte locations of the buffer memory 214 in order to withdraw an erroneous byte from an error location. 
XOR the enroneous byte with a correction value, and write the correction value to the error location, in order 
to implement an on-the-fly error correction method of the present invention. 

As noted repeatedly hereinabove, the disk drive 160 is a presently preferred subsystem or environment 

50 for utilization of the on-the-fly error correction techniques of the present invention. As will be appreciated by 
those skilled in the art many of the low level repetitive syndrome generating and checking tasks are carried 
out in the syndrome generator hardware shown in Figs 4A&4B and described in conjunction therewith. The high 
level task of calculating error correction values is carried out as needed by the microcontroller 196 in accor- 
dance with an error correction service routine program as outlined in this specification 

55 Preferably although not necessarily, the microcontroller 196 is a single monolithic microcontroller such as 

the NEC 78322. or equivalent, operating in a two-phase time divided arrangement per servo sector wherein a 
first time interval upon the arrival of each servo sector is devoted to servo control operations for head positio- 
ning, with a second and following time interval devoted to other tasks, including error conrection operations, for 
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example. An overview of this form of disk drive architecture is provided in commonly assigned U.S. Patent 
No.4,669,004 to Moon et a!., the disclosure of which is hereby incorporated by reference. A hierarchical system 
for managing the tasks performed by the microcontroller 196 during the second and following time interval is 
disclosed in commonly assigned U.S. Patent Application Serial No. 07/192,353. filed on May 10, 1988, now 
5 U.S. Patent No. 5.005.089 the disclosure of which is hereby incorporated by reference. The error connection 
functions of the exemplary microcontroller 196 could readily be carried out by one or more microcontrollers in 
a multiple microcontroller architecture, with improved real time performance, but at higher cost of implementa- 
tion. 

In overview, the error correction process for a data-read-from-disk operation includes during the data rea- 
10 ding back process (and after each data block has had syndrome bytes and cross check bytes appended thereto 
during a previous data writing operation) the following steps: 

1 . Preset the shift registers of the syndrome generator 10 to a predetermined known state, which may be 
a recurrent pattem of arbitrarily ordered ones and zeros. 

2. Generate syndrome remainder bytes and cross check syndrome remainder bytes for a data block (data 
15 sector) from the stream of data bytes read firom the disk by passing those bytes through the syndrome gene- 
rator 10. 

3. Check to see if the remainder bytes for each interleave are equal to zero as the last byte of each interleave 
is clocked into the syndrome generator 10. 

a. If the remainder bytes for each interieave are equal to zero, wait until the next block or sector, as no 
20 correction is required. 

b. If one or more remainder bytes for an interleave are not equal to zero and if the syndrome latch is 
available to hold the syndrome, proceed to step 4. ^ 

c. If the remainder bytes are not equal to zero and if the syndrome latch is not presently available to 
hold the syndrome (meaning that remainder bytes from the last data block remains latched therein 

25 during an already-commenced on-the-fly error correction operation), stop data flow; on-the-fly error 

correction cannot continue to be performed. Jfe 

4. Latch the remainder bytes for each interieave into the syndrome latch 68. These remainder bytes have imi 
the form RO. R1, R2 and R3 (for each interieave) wherein: ^ 

R(x) = R3x3 + R2x2 + R1x + RO (30) ^ 
30 After the remainder bytes Ri for a present data block are latched into the syndrome latch 68. the syndrome ^ 

generator 1 0 then continues to decode remainder bytes for the next data block. A pair of non-zero remainder - fe- 
bytes from adjacent data blocks will effectively shut down the presently described implementation of on- & 
the-fly error correction process as the microcontroller 196 presently employed lacks sufficient random r 
access memory space to receive and store remainder bytes from successive data blocks on-the-fly. Howe- ^ 
35 ver, there is no reason that increased random access memory space may be used to hold these bytes and n 

thereby enable continuing on-the-fly error correction for multiple successive data blocks. The potential for x - 
shut-down is also dependent upon the time required by the microcontroller 196 to correct the first enror and ^ 
test the conrection with the cross-chedc syndrome. A further limitation resides in the size of the block buffer 
memory 214. Also, the syndrome generator 10 may be modified by provision of additional circuitry for direc- 
40 tly calculating and putting out partial syndrome bytes to the microcontroller 196 via the syndrome latch in 

order to speed up enror correction. 

5. Cause the microcontroller 196 to call and execute the error correction service routine and begins exe- 
cuting that routine. As Is known by those skilled In the art, this step may be implemented through interrupt 
or polling techniques. 

45 6. Transfer remainder bytes from the remainder latch 68 to the microcontroller 196 on an as needed basis 

during execution of the error correction algorithm. The firmware then converts remainder bytes R(x) to par- 
tial syndrome bytes S(x) if any of the remainder bytes are unequal to zero, in accordance with: 

Si = R(x)mod(x -*- alpha* .5 . 

alphai* ^ \ ' 

50 where i = 0. 1 , 2 and 3 

[wherein the plus symbol in the exponent indicates arithmetic addition, rather than an exclusive-OR ope- 
ration] so that partial syndromes SO, SI. S2 and S3 are obtained. We must go through this conversion pro- 
cess because the syndrome generator circuit 1 0 is also used during data readback to provide th remaind r 
bytes Ri in order to more optimally minimize the amount of required hardware. 

55 7. For each interieave. convert the remainder bytes to partial syndrome bytes within an interieave of three: 

50 = Data modulo (X + alpha^26) 

51 = Data modulo (X + alpha^zT) 

52 = Data modulo (X + alpha^^s) 
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S3 = Data modulo (X + alpha^^S). 

a. On-The-Fly Single Burst Error Correction. For on-the-fly single-burst error correction, let L1 equal 
the enror location and e1 equal the error value. 

(1) If (SO = 0), or (SI = 0) or (S2 = 0) or (S3 = 0), then go to the double burst error conrection algorithm. 
5 (2) If (S1/S0 = S2/S1 = S3/S2) then continue, otherwise go to the double burst error correction algo- 
rithm. 

(3) Perform: 

L1 = Log(S1/S0] 

and 

10 e1 = S1 (S0/S1)127 

(4) If LI is outside of a predetemnlned range, (i.e. 0-1 75 for interleave 1 ; 0-174 for interleaves 2 and 
3), then go to double burst correction, else proceed to step 8 below. 

b. Double Burst Error Conrection. For double burst error correction, let LI and L2 equal the error loca- 
tions; and, let e1 and e2 equal the error values at the respective locations. 

15 (1) Calculate phi 1 and phi 2 in accordance with equations (16) and (17) above. 

(2) If the denominator or numerator of phi 1 or phi 2 is equal to zero, then flag an uncorrectable error 
to the host. 

(3) Find the roots of the quadratic equation (18) given above and output the resultant XI and X2 
values. 

20 (4) If phi (x) has no solution, flag an uncorrectable error to the host. 

(5) Calculate error locations L1 and L2 in accordance with equations (19) and (20) above. If either 
LI or L2 has a value located beyond the predetermined permissible range of values, then flag an 
uncorrectable error to the host 

(6) Calculate error valus el and e2 in accordance with the equations (21) apd (22) given above. If 
25 eithr el or e2 equals zero, then flag an uncorrectable error to the host, else proceed to step 8 below. 

8. The microcontroller 196 then computes the effect of the changed byte upon the cross check partial syn- 
drome and saves this computed value and repeats this process for each of the other two interleaves of the 
data block. 

9. If the correction passes the cross check checking step 8, the microcontroller 1 96 then corrects the error 
30 using its DMA channel to the buffer memory 214 leading through the micro interface 202 and buffer control 

210 by withdrawing the enroneous byte from the enror location within the block. XORing the enroneous byte 
with a correction value to yield a corrected byte, and writing the corrected byte back to the error location. 

10. Taking the error correction into account: 

a. If the cross check partial syndrome is equal to zero, the error correction process is detenmined to be 
35 within the probability of miscorrection and is therefore completed; the syndrome latch 68 is released, 

and the data block in the memory buffer 214 containing the corrected error is then released for transfer 
to the host 

b. If the cross check partial syndrome is unequal to zero, on-the-fly error conrection is stopped, and step 
1 1 is performed 

40 11. One or more retries are then undertaken, until a conrection is achieved, or the enror is determined to 

be much larger than originally contemplated, in which case an unconrectable enror is flagged to the host 
When a retry is commanded by programming at the microcontroller 1 96, the retry is performed by the micro- 
controller offline in the sense that this process is carried out after data flow has been interrupted. An error 
message may be generated and sent to the host, and the retry process may continue for e.g. eight retries, 
45 as is standard at the interface. 

Thus, on-the-fly ECC occurs essentially in the background as a part of the read channel process by which 
data is transferred from the disk to the host in a manner transparent to the host machine. If the transparent 
on-the-fly ECC methods described herein are unable to correct the error or errors to a user-acceptable confi- 
dence level, then more conventional enror recovery activities and procedures may occur in the foreground at 
50 the interface with the host, as per the conventional SCSI error correction standard, for example. 

Error detection processes for checking the ID fields are carried out entirely within the sequencer 100. so 
that if an error is detected in the ID field, repeats are made until either the sector is declared to be inaccessible, 
or it is accessed. 

Having thus described an embodiment of the invention, it will now be appreciated that the objects of the 
55 invention have been fully achieved, and it will be understood by those skilled in the art that many changes in 
construction and widely varying embodiments and applications of the invention will suggest themselves without 
departing from the spirit and scope of the invention. The disclosures and the description herein are purely illus- 
trative and are not intended to be in any sense limiting. 
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Claims 

1. A Reed-Solomon error correction apparatus for performing error correction upon a data block within a 
sequence of substantially contiguous data blocks flowing from a source to a destination via a block buffer 

5 memory means for temporarily storing multiple consecutive ones of the data blocks, microcontroller means 

(196) being provided for supervising the flow of the data blocks sent via the block buffer memory means, 
characterised in that the error correction is perfonmed on-the-fly and the apparatus comprises: Galois finite 
field syndrome generator and remainder recovery circuit means (10) connected to the stream of data to 
receive and process each data block from the source for recovering plural error correction remainder bytes 

10 for said data block, said bytes being related to syndrome bytes appended to the data block incident to It 

being sent from the source, the generator and remainder recovery circuit means (10) including checking 
means (78) for comparing predetermined ones of said recovered plural error correction syndrome bytes 
with a nominal value to detenmine equivalence or non-equivalence thereof; and latch means (68) for latch- 
ing recovered partial error correction remainder bytes in response to a non-equivalence as determined by 

15 said checking means (78); the microcontroller means (196) being responsive to a said non-equivalence 

to execute error correction sen/ice routine firmware means and thereupon have direct access to the latch 
means (68) for selectively obtaining the recovered error correction remainder bytes to calculate at least 
one error location and a corresponding error value from the obtained regenerated error correction remain- 
der bytes to generate corrected data to replace data of the block determined to be erroneous, and to control 

20 direct access means for directly accessing the buffer memory means for substituting the corrected data 

for the erroneous data of a block stored therein before it is transferred to the destination. 

€5- 

2. A Reed-Solomon error correction apparatus according to claim 1, wherein the generator and remainder - 
recovery circuit means (10) comprises Reed-Solomon cross check syndrome generator means for recov- 

25 ering cross checking syndrome information previously appended to each data block before it flows from 

said source and for storing said recovered cross checking syndrome information in said latch means (68), ^ 
and wherein the microcontroller means (196) upon executing the error conrection service routine firmware 
means further checks the impact of an error conrection upon the recovered cross checking syndrome infor- 
mation, 

30 ^ 

3. A Reed-Solomon error conrection apparatus according to claim 1 or 2. wherein the generator and'recovery ^ 
circuit means (10) further comprises error detection means for recovering and checking error detection 
syndrome information previously appended to a data ID field of each said data block. 

35 4. A Reed-Solomon error correction apparatus according to any one of claims 1 to 3. wherein the generator * 
and recovery circuit means (10) generates error correction coding syndrome infonmation in accordanc ^ 
with a Galois field (28) generated by a field generator polynomial having the fonm x® + x^ + x^ + 1 and 
wherein the first tenn of the field is x** x^ + x + 1 (which is 00101011 binary), each error correction syn- 
drome polynomial being of the form 

40 G(x) = x^ (alphai8),x3 + (alpha87).x2 + (alpha^8).xi + 1. 

5. A Reed-Solomon error correction apparatus according to claim 2 or 3, wherein the generator and remain- 
der recovery circuit means (1 0) comprise Reed=Solomon cross check syndrome recovery means for recov- 
ering cross checking remainder information related to cross check syndrome information in accordance 

45 with the polynomial G(x) = x^ ^ a, where a is unequal to one, appended to the data block incident to it being 

sent from the source; and the microcontroller means calculates the effect of the corrected data upon a 
cross check remainder infonmation. 

6. A Reed-Solomon error conrection apparatus according to daim 5, wherein the Reed-Solomon cross check 
50 syndrome information is in accordance with the polynomial G(x) = x^ + alphas 

7. A Reed-Solomon error correction apparatus according to claim 3, wherein the error detection information 
appended to said data ID field is in accordance with a two byte syndrome in accordance with the polyno- 
mial: 

55 Gedc2(x) = x2 + ax + b, 

where b is unequal to one, 

8. A Reed-Solomon error correction apparatus according to claim 7. wherein the error detection means gen- 
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erates a two byte syndrome in accordance with the polynomial: 

Gedc2(x) = x2 + X + alpha^®. 

9. A Reed-Solomon error correction apparatus according to claim 7, wherein the generator and recovery clr- 
5 cuit means further comprises error detection means for detecting an error in a data ID field of each said 

data block by generating a two byte syndrome in accordance with the polynomial: 

Gedc2(x) = x2 + alphas^x + alpha^i. 

10. A Reed-Solomon error correction apparatus according to claim 3, wherein the error detection information 
10 appended to said ID field is in accordance with a three byte syndrome in accordance with the polynomial: 

Gedc3(x) = x3 + ax2 + bx + c. 

11. A Reed-Solomon error correction apparatus according to claim 10. wherein the enror detection means gen- 
erates a three byte synndrome of the form: 

15 Gedc3(x) = x3 + alphae7.x2 + alpha^^.x + 1 

12. A Reed-Solomon error conrection apparatus according to claim 10. wherein the generator and recovery 
circuit means further comprises error detection means for detecting an error in a data ID field of the data 
block by generating a three byte syndrome of the form: 

20 Gedc3(x) = x3 + alpha8^x2 + alpha^^.x + alpha^s. 

13. A Reed-Solomon error correction apparatus according to any preceding claim for performing on-the-fly 
error correction within a disk drive data storage subsystem (160) including a source in the form of at least 
one rotating data storage disk (162). at least one radially positionable data transducer (170) for writing 

25 data to the disk and for reading data from the disk, and sequencer means (1 00) for sequencing data blocks 

from the block buffer memory means (214) serially onto the disk (162) during data writing operations and 
for sequencing serial data into data blocks for delivery to the block buffer memory means (21 4) during data 
reading operations, the block buffer memory means (214) being capable of storing a plurality of said 
blocks, the microprocessor means (196) supervising head positioner means (172) for positioning the data 

30 transducer head (1 70) at a selected data track location and for supervising transfer of data to a destination 

in the fonm of a host computer via the data sequencer means (100). the block buffer memory means (214) 
and an interface to the host. In which disk drive data storage subsystem: the Galois field syndrome 
generator and recovery circuit means (1 0) is connected to the sequencer means (1 00) to receive and pro- 
cess each data block for generating plural partial error correction syndrome bytes for said data block, said 

35 bytes being appended to the data block incident to it being written to the disk (162) during data writing, 

said generator and recovery circuit means (10) detecting errors by recovering plural error conrection 
remainder bytes from each data block read from the disk (162). 

14. An apparatus according to claim 13. wherein the programmed microprocessor means (196) comprises a 
40 single microprocessor (196). 

15. An apparatus according to claim 14, wherein the programmed microprocessor (196) divides its functional 
activities between head position control and supervision and data flow supervision including execution of 
said error conrection service routine. 

45 

16. A Reed-Solomon error correction code syndrome generator apparatus for use in a data block transfer path 
within a computing system, wherein each data block being transferred includes Reed-Solomon error cor- 
rection coding syndrome information appended at the end thereof for enabling detection and conrection 
of errors within data of the block, error correction code syndrome generator means calculate error correc- 

50 tion coding syndrome infonmation in accordance with a Galois field (2^) generated by a field generator 

polynomial having the fonm x^ + x* + x^ + x^ + 1 and wherein the first term of the field is x^ + x^ + x + 1 
(which is 00101011 binary) and each error correction syndrome polynomial being of the form G(x) = x^ + 
(alpha^®).x3 + (alphaS7).x2 + (alphai8).xi + 1 and characterised in that Reed-Solomon cross checking syn- 
drome information is appended to each data block wherein the error conrection coding syndrome infor- 

55 mation is derived further in relation to the cross checking syndrome information of the block and a cross 

check syndrome generator means calculates cross checking syndrome information from the polynomial 
of the form G(x) = x^ + b, where b is unequal to one. 

20 
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17. An apparatus according to claim 16, wherein the cross check syndrome generator means calculates cross 
checking syndrome information from the polynomial of the form G(x) = + alphas 

18. An apparatus according to daim 17. wherein the cross check syndrome generator means comprises an 
5 input (12) for receiving each incoming data block as a clocked data stream of serial bytes, first summing 

junction means (84) for summing each byte of the incoming data block clocked data stream with a feedback 
byte to produce a sum byte, first multiplier means (86) for multiplying the sum byte by alpha^ to yield a 
first product byte, first clocked latch stage means (90a) connected to the first multiplier means (86) for stor- 
ing the first product byte during a first byte clock period and for storing subsequent product bytes during 
10 subsequent byte clock periods, second clocked latch stage means (90b) connected to the first clocked 

latch stage means (90a) for storing the first product byte during a second byte clock period, the second 
clocked latch stage means (90b) for feeding back the first product byte during the second byte clock period 
as the feedback byte to the summing junction (84). 

15 19. A Reed-Solomon error correction code syndrome generator apparatus according to any one of claims 16 
to 18, wherein the error correction code syndrome generator means: operates on-the-f!y; includes a 
remainder latch means (68) ; and includes means for generating an error detection syndrome for a data 
ID field of the data block. 



20 20. An apparatus according to claim 19. wherein the means for generating an error detection syndrome for a 
data ID field of the data block generates a two byte syndrome of the form: 

Gedc2(x) = x2 + ax + b, ^ 

where b is unequal to one. 

25 21. An apparatus according to claim 20. wherein the means for generating an error detection syndrome gen- 

erates a two byte syndrome of the form: m 

Gedc2(x) = x2 + X + alpha^Q. sfe 

22. An apparatus according to claim 20, wherein the means for generating an error detection syndrome for a 

30 data ID field of the data block generates a two byte syndrome of the form: ^ 

Gedc2W = x2 + alpha^^.x + alpha8^ <S? 

23. An apparatus according to claim 19. wherein the means for generating an error detection syndrome for a ^ 
data ID field of the data block generates a three byte syndrome of the form: a 

35 Gedc3(x) = x^ + ax2 + bx + c :t 

24. An apparatus according to claim 23, wherein the means for generating an error detection syndrome for a * 
data ID field of the data block generates a three byte syndrome of the form: 

Gedc3(x) = x^ + alpha87.x2 + alphas^.x +1. 



40 



25. An apparatus according to claim 23, wherein the means for generating an error detection syndrome for a 
data ID field of the data block generates a three byte syndrome of the form: 

Gedc3(x) = x3 + alpha»^x2 + alpha^^.x + alpha^^. 



45 26. Within a data block decoding process, an on-the-fly error correction method for conrecting a said data block 
within a stream of substantially contiguous data blocks, each block having error syndrome information cal- 
culated and appended in accordance with a predetermined Reed-Solomon code during an encoding pro- 
cess occurring before the data decoding process, comprising the steps of: passing the block of data 
through a syndrome information recovery circuit (10) to obtain recovered information related to the error 

50 syndrome information, comparing the recovered information with zero to determine if an error burst is pre- 

sent within the data block, and if an error burst is determined to be present: latching the recovered infor- 
mation into a syndrome latch (68) in order to free the syndrome infomnation recovery circuit (10) for the 
next data block of the stream, causing a microcontroller means (196) to call and execute an error correction 
service routine firmware means and thereupon: selectively transferring bytes of the recovered information 

55 from the syndrome latch (68) to the microcontroller means (196). detenmining with th microcontroller 

means (1 96) and the firmware m ans the location and a corrected value for the error burst in accordance 
with a predetermined Reed-Solomon burst error correction algorithm implemented within the microcon- 
troller means (196) by the firmware means, and replacing the corrected value for the enror burst within the 

21 
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block detected to contain the error burst before it is delivered to a host system. 

27. The method according to claim 26. wherein the data block further includes Reed-Solomon cross check 
information calculated and appended thereto during the encoding process, the method further comprising 
5 the steps of latching recovered cross check remainder information into the remainder latch (68). converting 

the recovered cross check remainder information into recovered cross check partial syndrome information, 
and determining with the microcontroller means (li96) as enabled by the fimnware means the effect of the 
correct value upon the recovered cross check partial syndrome information to detect possible miscorrec- 
tion after the correct value has been determined. 

10 
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FIG. -2 
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@ On-the-fly error correction with embedded digital controller. 



(g) An on-the-fiy enror conrection method for cor- 
recting a data block within a 'stream of substan- 
tially contiguous data bloclcs {during a data 
decoding process, each block having error syn- 
drome information calculated and appended in 
accordance with a predetermined Reed . Solo- 
mon code during an encoding process- occur- 
ring before the data decoding process, 
comprising the steps of: 

passing the block of data through a syn- 
drome information recovery circuit to obtain 
recovered information related to the enror syrv 
drome information, 

comparing the recovered Information with 
zero to determine if an error burst is present 
within the data block, and 

if an error burst is determined to be pre- 
sent : 

latching the recovered infonmatton into a 
remainder latch in order to free the syndrome 
information recovery circuit for the next data 
block of the stream, 

causing a microcontroller to call and exec- 
ute an enror conrection service routine and 
thereupon : 

selectively transferring bytes of the recov- 
ered information from the remainder latch to 
the microcontroller, 

determining with the microcontroller the 
location and a corrected value for the error 
burst in accordance with a predetermined Reed 
Solomon error conrection algorithm implemen- 
ted within the microcontroller by the service 
routine, and 

substituting the corrected value for the 
error burst within the block detected to contain 
the error burst before it is delivered to a host 



system. Cross checking the correction, and 
error detection of data block ID fields is also 
disclosed. 
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